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Abstract: We show that in preferential attachment models with power-law 
^SJ ■ exponent r E (2, 3) the distance between randomly chosen vertices in the 

[ giant component is asymptotically equal to (4 + o(l)) 37^^73^; where 

[ denotes the number of nodes. This is twice the value obtained for several 

types of configuration models with the same power-law exponent. The extra 
. factor reveals the different structure of typical shortest paths in preferential 

attachment graphs. 
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One of the central observations in the theory of scale-free random networks is that in 
the case of power-law exponents r € (2, 3) networks are ultrasmall, which means that the 
distance of two randomly chosen nodes in the giant component of a graph with vertices is of 
00 ' asymptotic order log log A^. The first analytical, but mathematically nonrigorous, evidence 

■ for this general phenomenon can be found, for example, in Cohen and Havlin |CH03| or 

Dorogovtsev et al. |DMS03j . and there are also some early papers with rigorous results for 
I specific network models, in particular the work of Reittu and Norros [RN02j and the work 

of Chung and Lu [CL03]. 

In the present paper we refine this observation and identify graph distances including 
constant factors. Our main result is a universal technique for proving lower bounds for 
^ . typical distances, which in a wide range of examples matches the best upper bounds known 

I from the recent literature. The result is presented in the form of two theorems, which reveal 

that ultrasmall networks can be divided into two different universality classes: For the class of 
ultrasmall preferential attachment models the typical distances turn out to be twice as large 
as for models in the class of configuration models. This difference corresponds to different 
structures of typical shortest paths in the network. We show that the two classes can be 
easily identified from the form of the attachment probability densities in the networks. We 
remark here that our work is focused on typical distances in networks, as results on diameters 
tend to be model dependent and universality results are not to be expected. 

At least informally, we have some structural insight into typical shortest paths in ultrasmall 
networks, see for example Norros and Reittu [NR08j. For the class of configuration models it 
turns out that typical vertices in the giant component can be connected with a few steps to 
a core of the network. Within this core there is a hierarchy of layers of nodes with increasing 
connectivity and at the top a small inner core of highly connected nodes with very small 
diameter. A typical shortest path inside the core runs from one layer to the next until the 
inner core is reached, and then climbing down again until a vertex in the lowest layer of the 
core is again connected to a typical vertex. 
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A high degree of a vertex increases its connectivity to any other vertex, and hence the 
layers can be identified by vertex degrees. Very roughly speaking the jth layer consists of 
vertices with degree kj where 

and there are about 

log log 
-log(r-2) 

layers. The graph distance of two randomly chosen vertices in the giant component is 
therefore 

. log log N 

These asymptotics are rigorously confirmed for two variants of an inhomogeneous random 
graph model, by Chung and Lu [CL03J and Norros and Reittu |NR06j . and for a model with 
fixed degree sequence by van der Hofstad et al. in |HHZ07j . See also van der Hofstad |HoflO| 
for a summary of various results with detailed proofs. In general upper bounds on the 
distances can be obtained by verifying the above strategy, while our Theorem [2] provides a 
flexible (i.e. model-independent) approach to the lower bound. 

For the more complex class of ultrasmall preferential attachment models existing results 
are far less complete. Dommers et al. [DHHIO] show that for various ultrasmall preferential 
attachment models the typical distance of two vertices in the giant component is bounded 
from above by 

fA , n^^ log log ^ 

A corresponding lower bound, and hence confirmation of the exact factor 4, is stated as 
an interesting open problem by van der Hofstad and Hooghiemstra in [HH08[ IV. B] and 
again in [DHHlOj . see the remark following Theorem 1.7 and Section 1.2. Our main result. 
Theorem 1, provides this bound and confirms, somewhat surprisingly, that the upper bound 
is sharp. Besides the models given in [DHHIO] we will also describe other examples of random 
network models in the same universality class, in which Theorem 1 applies. 

Loosely speaking, the shortest paths in the class of preferential attachment models can be 
described as follows: Again, inside a core of highly connected vertices paths run from bottom 
to top and back through a hierarchy of layers defined as before. However, by construction of 
the preferential attachment models a high degree of a vertex does not increase its connectivity 
to all vertices but only to those introduced late into the system (which are typically outside 
the core). Therefore a path cannot directly connect one layer to another in one step, but it 
requires two steps: The paths run from one layer to a young vertex and from there back into 
the next higher layer. The distance of two typical vertices is therefore increased by a factor 
of two. 

In the following section we formulate the precise results, consisting of two simple hypothe- 
ses on a random network leading to the two different lower bound results, see Theorems [T] 
and [21 The section also contains a brief sketch of the proof technique and introduces the 
notation used in the proofs. In Section 3 we then discuss several examples of networks in 
the two universality classes. In all these examples upper bounds can either be found in the 
literature or derived by simple modifications of these proofs. Section 4 is devoted to the 
proofs of our main results. 
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2. Main results 

A (dynamic) network model is a sequence of random graphs {QN)NGn with the set of 
vertices of ^at given by [N] := {1, 2, . . . , N} and the set of unoriented edges of ^at given by a 
random symmetric subset of [N] x [N]. Occasionahy we shall allow multiple edges between 
the same pair of vertices, but this has no bearing on the connectivity problems discussed 
here, and is for convenience only. We write v w if the vertices v, w are connected by an 
edge in the graph Qj\f. The graph distance is given by 

d]\f{v, w) := min |n: 3v = vo,vi, . . . ,Vn = w € such that Vi-i oz;jVl<l<n|. 

The main aim of this paper is to provide techniques to find lower bounds on the typical 
distance, i.e. the asymptotic graph distance of two randomly chosen vertices in the graph Qj\f. 
Our first result is based on the following assumption. 

Assumption PA(7): 

There exists k such that, for all N and pairwise distinct vertices vq, . . . ,V£ € [N], 

i 

k=l 



In preferential attachment models with power law exponent r, Assumption PA(7) is typ- 
ically satisfied for all 7 > (r — 1)^^. Hence we expect these networks to be ultrasmall if and 
only if ^ < 7 < 1. Theorem [H our main result, gives a lower bound on the typical distance 
in this case. 



Theorem 1. Let {GN)Ne'N be a dynamic network model that satisfies Assumption PA(7) for 
some 7 satisfying | < 7 < 1, then, for random vertices V and W chosen independently and 
uniformly from [N] , we have 

W»')>4!^ + 0(1) 

log(Ti^) 

with high probability. 

Examples of network models, in which Theorem [1] can be applied, will be given as Exam- 
ples 1-2 in Section 3. They comprise various preferential attachment models with power law 
exponent r G (2,3). In all these cases Assumption PA(7) is satisfied for all 7 > (r — 1)""^, 
and the theorem implies that 

/ x log loe N 

dN(V, W)> U + 0(1)) , , -, with high probability as iV ^ 00. 

- log(T - 2) 

Matching upper bounds are known from the literature. 

An approach similar to the above can be used to study lower bounds for the typical 
distance of ultrasmall configuration networks. In this class the connection probabilities look 
different and we have to formulate a different assumption. 
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Assumption CM(7): 



There exists k such that, for all N and pairwise distinct vertices vq, . . . ,V£ € [N], 

e 



k=l 



In configuration models with power law exponent r, Assumption CM(7) is typically satis- 
fied for all 7 > (r — and again we expect these networks to be ultrasmall if ^ < 7 < 1. 

Theorem 2. Let {QN)NeN be a dynamic network model that satisfies Assumption CM(7) 
for some 7 satisfying | < 7 < 1, then, for random vertices V and W chosen independently 
and uniformly from [N] , we have 

log log N 

dN(V, W) > 2 - — + 0(1), with high probability as N ^ 00. 
logiii^;) 

Examples of network models, in which Theorem [5] can be applied, will be given as Ex- 
amples 3-5 in Section 3. They comprise a variety of configuration models with power law 
exponent r G (2,3). In all these cases Assumption CM(7) is satisfied for all 7 > (t — l)^"*^, 
and the theorem implies that 

/ N log: loE N 

dN(V, W)>(2 + 0(1)) , , -, with high probability as iV ^ 00. 

- log(T - 2) 

Again, in all examples matching upper bounds are known from the literature. 

The proof of both theorems is based on a constrained or truncated first order method, 
which we now briefly explain. We start with an explanation of the (unconstrained) first 
moment bound and its shortcomings. Let v, w be distinct vertices of Qi\f. Then, for 5 E N, 

25 

iNiv, w) < 26} = IP(^U U {f^H-fiO-'-o Vk-i w}^ 

k=l {vi,...,Vk-i) 
25 k 



< 



p{m, n) :- 



Wp{Vj-l,Vj), 
k=l {vi,...,Vk-i)j=^ 

where (vq, . . . , f^) is any collection of pairwise distinct vertices in Gn with vq = v and = w 
and, for m, n € N, 

K(m A n)^^(m V n)^^^ if PA(7) holds; 
K mr^i n-^i Af27-i if CM(7) holds. 

Note that one can assign each path (wq, . . . , ffc) the weight 

k 

p{vo, ...,Vk):= Wp{vj-i,Vj), (2.1) 
i=i 

and the upper bound is just the sum over the weights of all paths from f to li; of length 
no more than 26. The shortcoming of this bound is that the paths that contribute most to 
the total weight are those that connect u, resp. w, quickly to vertices with extremely small 
indices. Since these are typically not present in the network, such paths have to be removed 
in order to get a reasonable estimate. 
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To this end we define a decreasing sequence {ik)k=o,...,5 of positive integers and consider 
a tuple of vertices {vq, . . . ,Vn) as admissible if A fn-fc > for all k £ {0, . . . , 6 A n}. We 
denote by ^Jj' the event that there exists a path v = vq ^ ■ ■ ■ l^Li- the network such 

that fo > ^0) • • • ,iJk-i > ^k-i, Vk < ik, i-e. a path that traverses the threshold after exactly 
k steps. For fixed vertices v,w > io, the truncated first moment estimate is 

<5 5 2.5 

ndN{v,w)<26}<j2mt') + J2^{At'^ + Yl E ^vo^---^vn}, (2.2) 

^=1 k=l n = l {vQ,...,Vn) 

admissible 

where the admissible paths in the last sum start with vq = v and end with Vn = w. By 
assumption, 

F{vo o • • • o t;„} < p{vo, ...,Vn) 
so that for v > Iq and k = 1, . . . , 5, 

N N 4-1 

F{Ai^^)< ■■■ E E?'(^'^i"-"^'^)- (2.3) 

Given e > we choose Iq = \eN"\ and {ij)j=o,...,k decreasing fast enough so that the first two 
summands on the right hand side of (|2.2|) together are no larger than 2e. For k £ {1, . . . ,6}, 
set 

N N 

vi=h Vk-i=ik-i 

and set ^q\u) = Ij^^^j.. To rephrase the truncated moment estimate in terms of fx, note 
that p is symmetric so that, for all n <26 and n* := [n/2\, 

N N N 

^ F{vo ^ ■ ■ ■ ^ Vn} < Y ■ ■ ■ ^ ■■■ ^ p{v,...,Vn*)p{Vn*,---,w) 

(vQ,....Vn) V-i=ei V„*=i„* Vn-l=£l 
admissible 

N 

= J2 ^':l{Vn*Vn^n^M- (2-4) 

Using the recursive representation 

N 

we establish upper bounds for //^"'(u), and use these to show that the rightmost term in (j2.2p 
remains small if 6 is chosen sufficiently small. Using the input from Assumptions PA(7), 
resp. CM(7), this will lead to the lower bounds for the typical distance in both theorems. 
Detailed proofs will be given in Section 4. 

3. Examples 

In this section we give five examples, corresponding to the best understood models of 
ultrasmall networks in the mathematical literature. Examples 1-2 are of preferential attach- 
ment type and will be discussed using our main result, Theorem [H while Examples 3-5 are 
of configuration type and will be discussed using Theorem [2j 
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Example 1 (Preferential attachment with fixed outdegree). This class of models is studied 
in the work of Hooghiemstra, van der Hofstad and coauthors. We base our discussion on 
the paper |DHH10) . where three qualitatively similar models are considered, see also [HoflO] 
for a survey. We focus on the first model studied in [DH HIO], which is most convenient 
to define, the two variants can be treated with the same method. The model depends on 
two parameters, an integer m > 1 and a real 6 > — m. Roughly speaking, in every step a 
new vertex is added to the network and connected to m existing vertices with a probability 
proportional to their degree plus 6. Note that in the case m = 1 the network has the metric 
structure of a tree, making this a degenerate case of less interest. The case famously studied 
by Bollobas and Riordan [BR04J corresponds to 5 = and m > 2 and leads to a network 
with r = 3 and typical distance log N/ log log N, so that it lies outside the class of ultrasmall 
networks. 

We first generate a dynamic network model (Gn) for the case m = 1. By Z[n, N], n < N , 
we denote the degree of vertex n in (with the convention that self-loops add two towards 
the degree of the vertex to which they are attached). 

• Gi consists of a single vertex, labelled 1, with one self loop. 

• In each further step, given (/at, we insert one new vertex, labelled + 1, and one 
new edge into the network such that the new edge connects the new vertex to vertex 
m € [N] with probability 

. , , Z\m,N] + 5 

P{m o + 1 1 ^iv} 



N{2 + 6) + l + 6' 



or to itself with probability 

l + <5 



N{2 + 6) + l + 6' 

To generalise the model to arbitrary values of m, we take the graph G'^^j^ constructed using 
parameters m' = 1 and 6' = d/m, and merge vertices m{k — 1) + 1, ... , mk in the graph G'^j^i 
into a single vertex denoted k, keeping all edges. We obtain asymptotic degree distributions 
which are power laws with exponent r = 3 + ^, so that we expect to be in the ultrasmall 
range if and only if — m < 6 < 0. 

Proposition 3. For independent, uniformly chosen vertices V and W in the giant component 
of the preferential attachment model with parameters m > 2 and — m < 5 < Q , we have 

log log N 

dN{V, W) = (4 + o(l)) T— with high probability. 

-log(l + ^) 

Remark 1. The upper bound is proved in [DHHlOj . see the remark following Theorem 1.6. 
This paper leaves the problem of finding a lower bound open. We resolve this problem by 
verifying Assumption PA(7) for 7 = (2 + ^)~^ and applying Theorem [TJ 

Proof. We look at m = 1 first. In this case, we have, for I < m < n < N , 

™r 1 EZ[m, n — 11 + 5 

nm^n} = \' \ . 3.1 

n(2 + 0) — 1 

It is easy to see that 

n(2 + 6) 



E[Z[m,n] + 6 \ Z[m,n - 1]] = {Z[m,n-l]+5) 



n(2 + (5) - 1' 
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and hence 

r(n + i)r(m-^; 

E Z[m,n] + 5 = (1 + 5)— ^ 

L L ' J J ^ r(n + i±|)r(m) 



2+ 

In particular there exist constants < c < C such that 

c(—)^< EZ[m, n]<C (—)^ for all 1 < m < n. 
Combining this with (|3.ip yields, for 7 = and a suitable ki > 0, that 

P{m o n| < , ^ < Kin^^^m"'^ for ah 1< m < n. (3.2) 

~ n(2 + (5) - 1 - - ^ ' 

To verify PA(7), following |DHH10( Lemma 2.1] we find that for distinct vertices vq,...,vi 
all events of the form {vj-i -f-)- Vj ^ Vj+i} with j G {1, — 1} and Vj < Vj-i,Vj+i, and 
all events {vj-i o vj} which are not part of these, are nonpositively correlated, in the sense 
that the probability of all of them occurring is smaller than the product of the probabilities. 
Recalling also (|3.2p it remains to show that for m < v,w, 

F{v ^ m ^ w} < K2 v^^'^w'^-^m-'^'^, (3.3) 

for some finite constant K2 > 0. To this end we let {{Zn'"^^)n>m - /c,m G N} denote the col- 
lection of right-continuous Markov jump processes starting at Z!^'^^ = k, jumping instantly 
at time m and subsequently at integer time-steps following the rule 

in n I n J „(2 + (5) - (5 ^ n \ n > 

Note that {Z\m,n\)n>m = {Zn'"^^)n>m in law and that, for m < n, the event {m o n} 
corresponds to {AZn'""^ = 1}, where we write AZ^*"'™' := Zn'"^^ — Z^_^\ Note also that 
^0=0 >™) jg stochastically dominated by Zn'"^^ for k > kQ. Hence, for m < ni < n2, 

m~n2+2 ni+1 

E[z(f-'|Az(^-' = i] = E jnzi?r' = j\z':;-=k,Azi,'r' = '^} 

j=2 k=2 

X P{Z^'^'™' = k I AZ^";™) = 1} 



^ E E 

j=2 A:=2 



(ni(2 + 5) + 1 + 5) P{AZ4'r^ = 1} 



^2 {m{2 + 6) + l + 6)¥{AZi?r^ = 1}' 

As in the derivation of ()3.2p the expectation in the last line can be bounded from above by 
co{k + l)n2n^'^ , for some cq > 0. Similarly, we obtain F{AZni"^^ = 1} > cinj m^^ and 

for further constants ci,C2 > 0. Summarising, we obtain 

m— ni+l 

^[Z'^f^ I AZ'^'"' = 1] < c^nlnf''m'< ^ fc^PjZ*';!:' = A:} < C4 n^m"^, 

fc=2 

for some 03,04 > 0, and this establishes (13. 3p . Finally, passing from m = 1 to general m can 
be achieved by a simple union bound. □ 
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A different class of preferential attachment models was introduced in [DM09j and further 
studied in |DM10j . Here a new vertex is connected to any existing vertex independently with 
a probability depending (possibly nonlinearly) on its degree. In this model the number of 
edges created in every step is asymptotically Poisson distributed. 

Example 2 (Preferential attachment with variable outdegree). This model is studied in the 
work of Dereich, Morters and coauthors, see |DM11| for a survey. The model depends on 
a concave function /: N U {0} — >■ (0, oo), which is called the attachment rule. Roughly 
speaking, in every step a new vertex is added to the network and oriented edges from the 
new vertex to existing vertices are introduced independently with a probability proportional 
to the current degree of the existing vertex. 

More precisely, to generate a dynamic network model {Qn) we assume that / satisfies 
/(O) < 1 and /(I) — /(O) < 1. An important parameter derived from / is the limit 

,. fin) 
7 := hm , 

n^oo 11 

which always exists with < 7 < 1, by concavity. By Z[n, N], n < N , we denote the number 
of younger vertices to which vertex n is connected in Qn- 

• Qi consists of a single vertex, labelled 1, and no edges. 

• In the {N + l)st step, given Qn, we insert one new vertex, labelled + 1, and 
independently for any m G [A^] we introduce an edge from A^+1 to m with probability 

f{Z[m,N]) 
N 

By [DMOQ^ Theorem 1.1(b)] the conditional distribution given Qn of the number of edges 
created in the (A^+l)st step converges to a Poisson distribution and the empirical distribution 
of the degrees converges to a power law with exponent r = 1 + ^, or more precisely to a 
random probability vector (///j) satisfying 

hm — = 1 + -. 

k^oo lOgfc 7 

We therefore expect the network to be ultrasmall if and only if 7 > |. 

Proposition 4. For independent, uniformly chosen vertices V and W in the giant component 
of the preferential attachment model with attachment rule f and derived parameter 7 > ^, 
we have 

log log A^ 

diKfiV, W) = (4 + 0(1)) - — ^ with high probability. 

Remark 2. The upper bound can be proved by adapting the argument of |DHH10j . see the 
forthcoming thesis [Monl2] for details. For the lower bound we verify Assumption PA(7 + e), 
for any e > 0, and apply Theorem [TJ 

Proof. We first note that, for v < w [N], 

, Ef(Z\v,w-l]) 
w — 1 

To estimate the expectation we note that by concavity, given e > there exists k such that, 
for all n > k, we have /(n) < f{k) + (7 + e)(n — k). An easy calculation (see |DM10^ 
Lemma 2.7]) shows that 

E,f{Z[v, w-1]) < Ciw^^'^'y-^'' for a suitable constant Ci > 0. (3.4) 
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We now use ()3.4p to verify PA(7 + e). For v < w ^ [A^], all events {v -f-)- with different 
values of v are independent. Hence P{vo -H- • • • o can be decomposed into factors of 
the form T{vj-i o Vj o I'j+i} with Vj < Vj_i,Vj^i and factors of the form ¥{vj_i o Vj} 
for the remaining edges. It remains to estimate factors of the latter form. We may assume 
V < u < w and get 

[u — l){w — 1) 

Arguing as in the derivation of ()3.4p we get, for a suitable constant C2 > 0, 
E[f{Z[v,w- 1]) I Z[v, u-l] = k] <C2 f{k)w^^'u-^-'. 

Hence 

E[/(Z[t;,n - - 1])] < E[/(Z[^;, n - l])>^+^n-^-^ 

and, using a similar argument as above, we obtain C3 > such that 

21 ^ r'„„27+£y-27-£ 



Summarising, we obtain a constant C4 > such that 

as required to complete the proof. □ 

We now give three examples of random networks in the universality class of configuration 
models. The first two belong to the wide class of inhomogeneous random graphs, whose 
essential feature is the independence between different edges. 

Example 3 (Expected degree random graph) . This model is studied in the work of Chung and 
Lu, see [CL03] or [CL06j for a survey. In its general form the model depends on a triangular 
scheme w'^^\ . . . , wj^' of positive weights, where the weight wf^^ plays the role of the expected 
degree of vertex i in Qjsi. The model is defined by the following two requirements: 

• for every pair (i, j) with 1 < i ^ j < N the events {i -f-T- j} are independent, 

• for every pair (i, j) with 1 < i ^ j < N we have 

F{i ^ j} = ' ^ A 1, where := V'w!'^'. 

1=1 

Proposition 5. For independent, uniformly chosen vertices V and W in the expected degree 
random graph with weights satisfying 

c{^y KujI""' <C{fy foralll<i<N, 

for some 7 > ^ and constants < c < C , we have 

log log A'^ 

dNiV, W) = (2 + 0(1)) - — ^ with high probability. 

Proof. The upper bound is sketched in [CL03j . For the lower bound we have to check 
Assumption CM(7). Note that, using the upper bound on the weights, 

^From the lower bound on the weights we get that > cN, for some c > 0, and hence 
F{i -f-)- j} < K.N'^'^^^i^'^j^^ for a suitable k. Using the independence assumption we see that 
Condition CM(7) holds, and the lower bound follows from Theorem 2. □ 
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Example 4 (Conditionally Poissonian random graph). This model is studied in the work of 
Norros and Reittu, see |NR06j . It is based on drawing an independent, identically distributed 
sequence Ai,A2,... of positive capacities. Conditional on this sequence, the dynamical 
network model is constructed as follows: 

• Gi consists of a single vertex, labelled 1, and no edges. 

• In the (A^ + l)st step, given Gn, we insert one new vertex, labelled + 1, and 
independently for any m G [A^] we introduce a random number of edges between 
A^ + 1 and m according to a Poisson distribution with parameter 

A A " 



for Ln ■■= Afc. 



^^+1 k=i 
• We further remove each edge in independently with probability 1 — Ln/Ln+i-, 
and thus obtain Qn+i- 

Recall that having possibly several edges between two vertices has no relevance for the typical 
distances in the giant component. In order to be in the ultrasmall regime we require the law 
of the capacities to be power laws with exponent 2 < r < 3. 

Proposition 6. Assume that the capacities in the conditionally Poissonian random graph 
satisfy 

P{Ai > = (c + o(l)) for all sufficiently large x, 
where 2 < r < 3 and c > is constant. For independent, uniformly chosen vertices V and 
W in the giant component we have 

log log A'^ 

d^iV, W) = (2 + oil)) — - — r- with high probability. 

- log(r - 2) 

Remark 3. The upper bound is proved in |NR061 Theorem 4.2], where it is also shown that a 
giant component exists. For the lower bound we verify Assumption CM(7) for 7 = l/(r — 1) 
and apply Theorem [2l 

Proof. We check that Assumption CM(7) holds with high probability, conditionally given 
the capacities. For fixed A^ we put the capacities in decreasing order 

and relabel the vertices so that the jth vertex has weig ht A^\ We recall from |NR,n61 
Proposition 2.1] that the number of edges between vertices i and j in is Poisson distributed 
with parameter A^'A^'/Ltv- As the edges are conditionally independent we only have to 
verify that, given e > there exists k > such that 

1 - exp - '^'ff'^' ) < KN'^^-^i-^ij-^i for ah 1 < i < j < AT, (3.5) 

with probability > 1 — 2e. By the law of large numbers is of order A^, so that it suffices 
to establish A^^ < K^N/i)"' for all 1 < i < A^. To this end we denote by Sj^' the number 
of potential values exceeding K{N/i)'^. The random variable S'"^ is binomially distributed 
with parameters N and p := P{Ai > K^N/i)"^} < c{k) where c(x) J, for x ^ cxd. By 
Bernstein's inequality, see e.g. [Ben621 (8)], 

-iV2 



P{5j;' >2i} < exp 



Var(5^') + i/3 



< e 8 * if c(k) < 1. 



N . 

Hence we may choose M large enough so that X^i^A^ ^^P(~i^) < ^' ensuring that with 
probability exceeding 1 — e we have A%''> < k {N/iy< for aU i>M.\i remains to give bounds 
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on . . . , A^*^^ By a standard Poisson approximation result, see e.g. [ResOSl Proposition 
3.21], we note that for any 1 < i < 2M, we have that S^^ converges weakly to a Poisson 
distribution with parameter A := limAr^oo -^lP{Ai > K{N/i)'^} < 2c{k)M, and hence, by 
choosing k large, we can ensure that for large N, we have J2l=i > ^} < e, which 

completes the proof. □ 

A model which also falls in the universality class of configuration models are the random 
networks with fixed degree sequenc^^l- This model is well studied and very detailed results 
on average distances in the case of power laws with exponent r G (2, 3) are obtained, in 
particular by van der Hofstad et al. in |HHZ07] . 

Example 5 (Random networks with fixed degree sequence). The idea behind this class of 
models is to enforce a particular power-law exponent by fixing the degree sequence of the 
network in a first step. We therefore choose a sequence Di, D2, ■ ■ ■ of independent and 
identically distributed random variables with values in the nonnegative integers. For given 
we assume that 

N 

i=i 

is even, which may be achieved by replacing D]\f by -D at — 1 if necessary. Thus given 
Di, . . . , Dn we construct the network ^at as follows: 

• To any vertex m £ [N] we attach Dm half-edges or stubs. 

• The L]sf stubs are given an (arbitrary) order. 

• We start by pairing the first stub with a (uniformly) randomly chosen other stub, 
and continue pairing the lowest numbered unpaired stub with a remaining randomly 
chosen stub until all stubs are matched. 

• Any pair of stubs are connect to form an edge. 

Obviously the resulting network can have self- loop and double edges, but this has no relevance 
for the typical distances in the giant component. In order to be in the ultrasmall regime we 
require the law of the degrees to be a power law with exponent 2 < r < 3. 

Proposition 7. Assume that there exists c > such that 

f{Di > x} = x^^^ (c + 0(1)) for all sufficiently large x. 

For independent, uniformly chosen vertices V and W in the giant component we have 

log log A'^ 

d^iV, W) = (2 + 0(1)) — - — with high probability. 

- log(r - 2) 

Remark 4. This and much more is proved in |HHZ07l Theorem 1.2]. For an alternative 
approach to the lower bound we now verify Assumption CM(7) for any 7 < 1/(t — 1) and 
paths of length up to ^ = ©(log log A^), which is clearly sufficient to apply Theorem [2l 

Proof. We observe that, given Di, . . . , Dj\f, for pairwise disjoint vertices vi, . . . ,vi, ve+i, 
P|f^ O Vi+i I Wi O V2 o • • • o < ^ 



^In fact, in the literature these models are often called configuration models. We prefer to use the term for 
the wider class of models where vertices are equipped with an a-priori configuration of individual features. 
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conditioning event. In particular, P{i o j} < ^^^^ . Using the law of large numbers one 



where the denominator is a rough lower bound on the number of stubs unaffected by the 
conditioning event. In particular, ¥{i o j} < 
can easily see that there is a c > such that 

e 

L]\f — 2 D^^, > cN with high probability, 

k=l 

for any choice of vi,. . . ,ve, if ^ = 0(loglog A^). Therefore, to verify Assumption CM(7) we 
only need to find appropriate bounds on the degrees of given vertices, which can be achieved 
(using the same relabeling) by a similar argument as in Example 4. □ 

4. Proofs 

4.1. Proof of Theorem [H In this section, we assume validity of Assumption PA(7) for a 
7 G 1) with a fixed constant n. Given a vector ((/(I), . . . , q{n)) we use the notation 

m 

qlm] := ^^^(O all 1 < m < n. 

i=l 

We adopt the notation of the discussion at the end of Section 2. In particular recall the 
definition of ji^^^ and the key estimates (12. 2p . (12. 3p and (12. 4p . which combined give 

S S 25 N 

r{dN{v,w) <2S} <J2^^t%-n + J2^'k'^^^-^^ + Yl E (4.i) 

k=l k=l n=l u=£^* 

The remaining task of the proof is to choose 5 G N and 2 <£§<...< Iq < N which allow 
the required estimates for the right hand side. To do so we will make use of the recursive 
representation 

N 

fJ-k+ii^) — jj!]^\m)p{'m,n) for A; € {0, . . . , J — 1} and n G [A^], 

where ^Q\n) = '^{v = n} and 

p(m, n) = K{m A n)~'^{m V n)^~^. 

Denote by p!-^\m) = ]1|^>^^ j ^^"'(m) the truncated version of /u^"' and conceive //J^"' and /ij,"' 
as row vectors. Then 

/^Lti = ^N, (4.2) 
where Pat = {p{m,n))m,n=i,...,N ■ Our aim is to provide a majorant of the form 

/i^"'(m) < aunr^ + 1|^>4_ j/3fcm'^~^ (4.3) 
for suitably chosen parameters Oi^^Pk ^ 0. Key to this choice is the following lemma. 
Lemma 8. Suppose that 2 < i < N , a,/3>0 and q: [N] —?■ [0, oo) satisfies 

q{m) < l{m > £}{am'^ + /3m'^^^) for all m G [A^]. 
Then there exists a constant c > 1 ( depending only on 7 and n ) such that 

, AT-, s 

7-1 



qPNim) < c(alog(^) + /S N^''-^)m~^' + l{m > i}c(af-^^ +/31og^^ 



m ' 



for all m e[N]. 



TYPICAL DISTANCES IN ULTRASMALL PREFERENTIAL ATTACHMENT GRAPHS 13 
Proof. One has 

m-l N 

gPAr(m) = l{m > (;(A;)p(A;, m) + q{k)p{k,m) 

k=e k=mV£ 
m-l N 

<t{m> i}Y^K{ak-^ + Pk^-^)k-''m^-^ + ^ K{ak~^ + (3k''-^)k''-^m~^ 
k=e k—mve 
/ N N \ 

<K a k-' + (3 ^ k^^~^] 

\ k=mve k=m\Jt ) 

/ m-l 



■m— 1 m— 1 \ 



m ' 



k=t k=e 



1 / 27-1 

™ 7-1 



This impHes immediately the assertion since i >2hy assumption. □ 
We apply Lemma [8] iteratively. Fix e > small and start with 

4 = \£N~\, ai = KieNy-^ and /3i = K{eN)-^. 
Fix V > Then, for all m € [N], 

fi^^\m) =p{v,m) < kIq^^iti'''^ + l{m > io} KiQ^rrf'^^ 
< aim^^ + l{m > £0} Pim?"^ . 
Now suppose, for some k G N, we have chosen ak,Pk and an integer £k~i such that 

lJi^^\m) < ak'm~'^ + fikrri'^^ for all m G [A^]. 
We choose Ik as an integer satisfying 

>A:«4"^ (4.4) 



and assume ^ 2. Pick a^^Pk such that 

afe+i >c(afe log (g) +/3fciV27-i), 

>c(Q4~'^ + /3fclog(f 



(4.5) 



By the induction hypothesis we can apply Lemma[8]with £ = ik and q{m) = ij!-^\m). Then, 
using (III2D, 



/il+i(rn) < otk+im-^ + t{m > 4}/5fc+im^-^ for ah m e [N], (4.6) 

showing that the induction can be carried forward up to the point where H.^ <2. 
Summing over ()4.6p and using (14. 4p we obtain 
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Hence the first two summands on the right hand side in (|4.ip are together smaller than 2e. 
It remains to choose 5 = 6{N) as large as possible while ensuring that ^5 > 2 and 



2<5 N 

N^oo 



hm £ Yl Mi".'(^R"in.M=0. 



n=l u=£„* 

To this end assume that ik is the largest integer satisfying ()4.4p and the parameters , f3k 
are defined via equalities in (|4.5p . To establish lower bounds for the decay oi£k we investigate 
the growth of rjk := N/i^ > 0. Going backwards through the definitions yields, for k > 1, 
that 

1^+2+ AT j ^ P ^fc + iOgr/fe+i, 

with r?i,r/2 < Cq for some constant Cq > (which, as all constants in this paragraph, may 
depend on e). It is easy to check inductively that for any solution of this system there exist 
constants b,B > such that, 



1-7; 

We now use (14.61) to estimate 



%<6exp SL/t^ • (4.7) 



25 AT S N 

n=l u=tj^ k=l u=£k 

5 



k=l 

Using (j4.4p and (j4.7p the first summand in the bracket can be estimated by 

ali]-'-' < (r^fld - 7))^^,-^ < (^(1 - 7))^ ^ exp {b [^Y'^ 

Using equality in (|4.5p we get (is < c{asl\~'^'^ + asN^^'^'^ log{N/£s))- Noting that the second 
summand on the right hand side is bounded by a multiple of the first, we find a constant 
Ci > such that /JfiY^'^'^i < Cia|£j~^'^, and thus, for a suitable constant C2 > 0, 

25 N 



E E f^n*i^)^^n-n*i^) < C2 exp [b 



,5/2 

n=l u=lk 

Hence, for a suitable constant C > 0, choosing 



^ ^ log log N _ ^ 

we obtain that the term we consider goes to zero of order 0((log log A^)"^). Note from (g^D 
that this choice also ensures that ^5 > 2. We have thus shown that 

¥{dN{v,w) > 26} < 2e + C'((loglogiV)-3), 

whenever v,w > io = [eiV], which implies the statement of Theorem 1. 
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4.2. Proof of Theorem [2l In this section, we assume validity of Assumption CM(7) for 
some 7 € (^,1) with a fixed constant k > 1. Recah again the notation and framework from 
the introductory chapter. We use the same approach as in the proof of Theorem [T] but now 
we have to consider the matrix P^v := {pi'm',n-))m.,ne[N] given by 

p{m,n) := Km~^n~'^ N'^^~^ for m,n ^ [N]. (4-8) 

We obtain the following lemma, which is the analogue of Lemma [HI 



Lemma 9. Suppose that 2 < i < N and q : [N] — > [0, oo) satisfies 

q{m) < l{m > £} rai^^r^i for all m e [N]. 

Then, for all m G [A^], 

gP^vM < Km-m^-' (f)^log(f^). 

Proof. By (j4.8|) and the assumption on q, 

N N 
qPN{m) = Qii)p{h m) < Km-'T^'N^''-^ J2l - i^rn-'T^ N'^''-^ log (frr) > 
1=1 i=e 

which implies the statement of the lemma. □ 

For fixed e > we first construct inductively a strictly decreasing sequence of integers 
{£k)k=o,...,6 by letting £o = [£-^1 and defining l^+i as the largest integer such that, given 

Recall the definition and recursive formula for and let fi^^\m) := l{m > 
Then i^i^l ^(m) — /i[."''Pjv(m). We now apply inductively Lemma[9]and obtain, 

/Ufc'(m) < iim-^N^-^(^-^y\og (^£7^) < ^h m € [N]. (4.10) 

Note that the second inequality in (I4.10p follows from (14. 9p . and hence Ji^^\m) < m'^^^^^'^, 
which allows us to continue the induction. Considering the truncated first moment esti- 
mate ()2.2p for our choice of (^fc)fc=o,...,(5) we obtain from (j4.10p that 



Hence (|4.9p entails that X]A-=i^(^fcO — ^- '^^^ ^*^P choose 5 = 5{N) as large as 
possible while ensuring that ^5 > 2 and 

25 N 

hm l^^l^-n*^ = 0- (4.11) 

71=1 U=l„* 

By (|4.10p the term on the left can be bounded by a constant multiple of N'^^ Xyfc=i ' 
To verify (14. lip we have to bound the growth of the values rfk := j^- The choice made in 
()4.9p implies that {'r]k)k>o obeys 770 < and 

iVkli + W < T^,^P^vPog{2,,), for k > 0. 
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From this it is straightforward to verify inductively the existence of constants b,B > 0, which 
only depend on e, k and 7, such that 



Hence, we may choose a suitable constant C > such that for 

^ ^ log log N _ ^ 

^ ' 7 ' 



log^T^ 

we have is > 2. To complete the proof, we note that 

k=l k=l 

which implies convergence in (j4.1ip when C is chosen large enough. 
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